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ON USING FEEDBACK IN A GAUSSIAN CHANNEL 

For information transmission a discrete time channel with independent 
additive Gaussian noise is used. There is also another channel with independent 
additive Gaussian noise (the feedback channel), and the transmitter observes 
without delay all outputs of the forward channel via that channel. Transmission 
of nonexponential number of messages is considered (i.e. transmission rate equals 
zero) and the achievable decoding error exponent for such a combination of 
channels is investigated. The transmission method strengthens the method used 
by authors earlier for BSC and Gaussian channels. In particular, for small 
feedback noise, it allows to gain 33.3% (instead of 23.6% earlier in the similar 
case of Gaussian channel). 


§ 1. Introduction and main result 

In the paper results of |T] are strengthened and proofs are simplified. We consider the 
discrete time channel with independent additive Gaussian noise, i.e. if x — (aq,..., x n ) is 
the input codeword then the received block y = (yi,, y n ) is 

yi = Xi + fi, i = l,...,n, (1) 

where £ = (£i,...,£ n ) are independent A/”(0,1)-Gaussian random variables, i.e. E£, : = 
0, E^ * 2 = 1. There is the noisy feedback channel, and the transmitter observes (without 
delay) all outputs {%;} of the forward channel via that noisy feedback channel 

Zi = yi + cnii , i = l,...,n, (2) 

where r] = (ry,... ,r) n ) are independent (and independent of £) A/"(0,1)-Gaussian random 
variables, i.e. Er/, = 0, E rff = 1. The value a > 0, characterizing feedback channel noise 
intensity, is given. No coding is used in the feedback channel (i.e. the receiver simply re¬ 
transmits all received outputs to the transmitter). In other words, the feedback channel is 
“passive”. 
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We assume that the input block x satisfies the constraint 

n 

J2 X i~ nA ’ ( 3 ) 

2=1 

where A is a given constant. We denote by AWGN(A) the channel (JT]) with constraint (131) 
without feedback, and by AWGN(A, cr) that channel with noisy feedback ([2]) . The capacity 
of both channels equals C(A ) = [ln(l + A)\/ 2. 

We consider the case when the overall transmission time n and M = e°^ n \ n —>■ oo, 
equiprobable messages {9i,... ,9 m} are given. After the moment n, the receiver makes a 
decision 9 on the message transmitted. We are interested in the best possible decoding error 
exponent (and whether it exceeds the similar exponent of the channel without feedback). 

It is well known [2] that even noiseless feedback does not increase the capacity of the 
Gaussian channel (or any other memoryless channel). However, feedback allows to improve 
the decoding error exponent (channel reliability function) with respect to no-feedback 
channel. Possibility of such improvement stimulated a good interest to that topic in 60-80’s. 
A good number of interesting results have been obtained during that period (e.g. [3-10]). 
Unfortunately, all those papers had a common drawback: their methods were heavily based 
on the assumption that the feedback is noiseless. It was necessary in order to have perfect 
mutual coordination between both the transmitter and the receiver. Essentially, any noise in 
the feedback link destroyed that coordination and all hypothetical improvements. It was not 
clear whether it is possible to improve communication characteristics using more realistic 
noisy feedback. 

That uncertainty with noisy feedback remained till 2008, when in [H]-|TI] it was shown 
(for BSC) how to use such feedback in order to improve the decoding error exponent. 
Although improvement was not large (approximately 14.3%. for small feedback noise), it 
was the Erst method that worked for noisy feedback. Later results m and this paper) are 
developments of mm- 

In order to explain what is new in the paper, remind briefly what was done in earlier 
papers mm and [T]. For that purpose we explain first why noiseless feedback allows 
to improve decoding error exponent. For a channel without feedback that exponent is 
determined (for small transmission rates R) by the code distance of the code used (i.e. by the 
minimal distance among codewords). Noiseless feedback allows during transmission to change 
the code (code function) used, e.g. increasing the distances among most probable codewords. 
That feature allowed to improve the decoding error exponent. But for that purpose an ideal 
coordination between both the transmitter and the receiver are required. 

In all papers ra-ra, m and this one coding function can be changed only at one fixed 
moment (“switching moment”). In pT]-[T4j such change took place only if two most probable 
codewords were much more probable than all remaining codewords. It was shown that if 
noise in the feedback channel is less than a certain critical value p (:nt , then it is possible to 
choose transmission parameters such that the probability of miscoordination between the 
transmitter and the receiver becomes smaller than decoding error probability. That fact 
allowed to improve the decoding error exponent with respect to no-feedback channel. 
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Later in the paper [T] for Gaussian channel that method was strengthened taking into 
account not two, but three most probable codewords. Moreover, the decoding method was 
improved. It allowed not only to improve the gain (23.6% instead of 14.3% in [12], but also 
to show that for any noise intensity a 2 < oo it is possible to improve the best error exponent 
of AWGN(A) no-feedback channel. Of course, if a is not small then the gain is small, but 
it is strongly positive. In other words, in the problem considered there is no any critical 
level cr crit , beyond which it is not possible to improve the error exponent of the no-feedback 
channel. It should be noticed also that the investigation method with optimal decoding in 
|T] was rather tedious. 

The method of papers [11] [12] was applied to Gaussian channel AWGN(A, a ) in [15] with 
similar to mm results (in particular, with the same asymptotic gain 14.3%). 

The aim of the paper is to strengthen the transmission method [I] (in particular, using 
up to four most probable codewords) and also simplify its analysis. It allows to improve the 
gain up to 33.3% (instead of 23.6% in [I]). 

Remark 1. We consider the case when the value cr 2 > 0 is fixed and does not depend on 
the number of messages M. 

For x,y el" denote 

n 

O ,y) = 

i=1 

A subset C = {xi,..., xm} with ||xj|| 2 = An , i = 1,..., M is called a (M, A, n)-code of 
length n. 

For a code C = {x,} denote by P e (C ) the minimal possible decoding error probability 


\x\ 


= x, x 


d(x,y ) = \\x - y\ 


P e (C) = minmaxP(e|xj), 


where P(e|x*) - conditional decoding error probability provided Xj was transmitted, and 
minimum is taken over all decoding methods (it will be convenient to denote the message 
transmitted as 6i and X* as well). 

In the paper we consider the case when M = M n —> oo, but M n = e°^ as n —* oo (it 
corresponds to zero-rate of transmission). For M messages and AWGN(A) channel denote 
by P e (M, A, n) the minimal possible decoding error probability for the best (M,A, n )-code 
and introduce the exponent (in n) of that function [ 16] 


E{A) 


limsup — In 

n—>■ oo Tl 

M—>■ oo 
In M=o(n) 


l 

P e (M, A, n) 


A 

I' 


( 4 ) 


Similarly, for AWGN(A, cr) channel with noisy feedback denote by P e (M, A,a,?i) the 
minimal possible decoding error probability and introduce the function 


F(A,a) 


limsup — In 

n—>■ oo Tl 

M—>■ oo 
In M=o(n) 


l 

P e (M,A,a,n)' 
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It is also known that if a = 0 (i.e. noiseless feedback) then [7] 


HA o) = 4 (5) 

For AWGN(A, a) channel denote by Fi(A, cr) the best error exponent for the transmission 
method with one switching moment, described in §2. Then Fi(A,cr ) < F(A, a) for all A, a. 
The paper main result is as follows. 

Theore m. Let M —» oo and In M = o(n), n —> oo. Then the formula holds 


F\(A, a) > 


AQ- ~ o r2 ) 
3 


( 6 ) 


For small a the formula ((6|) gives 33.3% of improvement with respect to no-feedback 
channel (see (]4]l). It is given in a simplified form oriented to small values of a. A more 
general formula (following from results of §4) would be too bulky. 

Remark 2. The method described in the paper and its analysis can be generalized on slow 
growing number N = N (a) of switches. It allows to prove the following result 


Fn(ct) (A , a) 


A(1 + o(cr)) 
2 


a —> 0 . 


(7) 


In other words, for small a the formula (|7J) gives improvement of 100% with respect to no¬ 
feedback channel (see (]4j)), and it coincides with similar result ([5]) for noiseless feedback. It 
will be done in another paper. 

In §2 the transmission method with one switching moment and its decoding are described. 
In §§3-4 its analysis is performed and the theorem is proved. Greek letters £,?7 ,C,£i, • • • in 
the paper designate A7(0,1)-Gaussian random variables. 


§ 2. Transmission/decoding method 


We use the transmission strategy with one fixed switching moment at which the code 
used will be changed. Denote n\ = n/ 2 and partition the total transmission time [l,n] on 
two phases: [l,ni] (phase I) and [n\ + l,n] (phase II). After moment n the receiver makes a 
decision in favor of the most probable message 6 1 , (based on all received on [1 ,n] signals). 

Each of M codewords {x t } of length n have the form x, L = (xf x"), where both x\ (to 
be used on phase I) and x " (to be used on phase II) have length ri\ . 

Similarly, the received block y has the form y = ( y ', y"), where y' is the block received 
on phase I and y" is the block received on phase II. Denote by z' the received (by the 
transmitter) block on phase I. The codewords first parts {#(} are fixed, while the second 
parts will depend on the block z' received by the transmitter on phase I. 

We set two positive constants A±, A 2 such that 

Ai + %2 = nA : ( 8 ) 
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and denote 


( 9 ) 


Then A = (1 + f$)A\jn. At the end of Theorem proof we set j3 = 1/2. 

Denoting 

<k = d(x[, y') = \\y -x[\\ 2 , 

arrange the distances {d t , i — 1,..., M} for the receiver after phase I in the increasing order, 
and denote 

d (1) = min dt < d (2) < ... < d (M) = max d t 
i i 

(case of tie has zero probability). Let also x '^\... ,x ,l ' AI ' > be the corresponding ranking of 
codewords {*'} after phase I for the receiver, i.e x is the closest to y ' codeword, etc. 
Similarly, denoting 

d? = d(x[, z') = || z! - || 2 , 

arrange the distances {df\ i — 1,..., M} for the transmitter after phase I in the increasing 
order, denoting 

d (1)t = min df < d {2)t <...< d {M)t = ma xdfl 

i i 

Let also x '^ 1 1 ..., be the corresponding ranking of codewords {x'} after phase I for 

the transmitter, i.e x is the closest to z' codeword, etc. 

Transmission method with one switching moment. We choose a set /C of codes C 
which the transmitter may use on phase II. A code C G /C used on phase II depends on the 
received block z'. Based on y', the receiver finds the probability distribution P r (C\y'), C G /C 
of the code C used by the transmitter on phase II, and uses that distribution for optimal 
decoding. It is a crucial point of the whole method. 

Transmission. In order to simplify exposition it is sufficient to consider the case M < 
(n + 2)/2. Then on both phases we will be able to use orthogonal codes of length ri\ = nj 2. 
The case of arbitrary M, such that M = e°^ n \ n —> oo can be considered replacing orthogonal 
codes by “almost” equidistant codes. Then all calculations remain essentially the same (see 
details in HD- 

Phase I. The transmitter uses the orthogonal code of M codewords {a/} of length n\ 
such that ||a/1| 2 = A±. 

Phase II. We set nonnegative numbers r 2 and r 3 . Based on the received block z' and 
numbers r 2 ,T 3 , the transmitter chooses k = k(z' ,T 2 ,r 3 ) most probable (for him) messages 
k = 2,3,4. Denote that set of messages as 

S k = ja/ (1) V..,a; /(fe)t }, k — 2,3,4. (10) 

The code length for phase II we partition on two parts: of length 3 for selected 
k G {2,3,4} messages and of length — 3 for remaining n — k messages, respectively. The 
transmitter uses the following code C" = C"{z!) with ||a;J|| 2 = A 2 , j = 1 ,..., M. 
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1) If d^ — d > 2A\T2, then the transmitter selects two most probable (for him) 
messages 9j , 9,j (i.e. k = 2) and uses for them opposite codewords x" = — x” that have 
nonzero coordinates only at time instant n\ + 1. 

For remaining M — 2 messages {#,} the orthogonal code of M — 2 codewords { x of 
length TT-i — 3 is used. That code have zero components at time instants rii + 1, + 2, rii + 3, 

and all its codewords { x " s } are orthogonal to the first two codewords ( x'-,x "). 

2) If d^ — d< 2t4iT 2 , d^ — d^ > 2 AiT% then the transmitter selects three most 
probable (for him) messages 9i,9j,9 m (i.e. k = 3) and uses for them the 3-simplex code 
occupying time instants ri\ + 1, n\ + 2. 

For remaining M — 3 messages {0s} the orthogonal code of codewords { x "} of length 
rii — 3 is used. That code have zero components at time instants n i + 1, rii + 2, n\ + 3, and 
all its codewords {cc"} are orthogonal to the first three codewords (a;", x", x'^). 

3) If d — d ^ < 2A 1 t 2 , d ^ — d ^ < 2v4 1 t 3 , then the transmitter selects four most 
probable (for him) messages 9 t , 9j , 9 m , 9i (i.e. k = 4) and uses for them the 4-simplex code, 
occupying time instants n\ + 1, rii + 2, n\ + 3. 

For remaining M — 4 messages {6h} the orthogonal code of codewords {x"} of length 
iii — 3 is used. That code have zero components at time instants rii + 1, rii + 2, ni + 3, and 
all its codewords { x" s } are orthogonal to the first four codewords (a:'/, x", x” n , x"). 

This transmission method strengthens the method used in [lj, [Tl]-[T4], where only two 
or three messages were selected. 

Note also that the set S k of selected messages should be such that with high probability 
the true message 9 tvue £ S k , but the number k is small as possible. 

Remark 3. Introducing additional parameters r 4 ,..., it is possible to strengthen the 
method used, but it gives not a big improvement of the results obtained. Much more 
improvement can be obtained using an increasing number of N = N(cr) (see remark 2). 

Decoding. Due to noise in the feedback channel the receiver does not know exactly 
codewords x'^, a;'^ 4 ,... and therefore it does not know the code used on phase II. But based 
on the received block y' it may evaluate probabilities of all possible codewords a;'^ 4 , x'^,... 
and find the probabilities with which any code C" was used on phase II. It allows to the 
receiver, based on the full received block y = ( y ', y"), to find posterior probabilities {p(y\xi)} 
and make decision in favor of most probable message d,;. Such full decoding is described in 
details in the next section. 


§ 3. Full decoding and error probability P c 


Since ||a:,;|| 2 — A, i = 1,..., M, for the likelihood ratio we have 


, p(y\ x i) 
n p{y\xi) 


(Xi - Xi,y). 


If a; trU e is the true codeword then y = aj trU e + £, and £ = (£\ = (£i,..., £ n ), where all {^} 

are independent 7V"(0,1)-Gaussian random variables. If a? true = Xi, then 


, p(y\xi) 

n p(y l*i) 


(Xi - £Ci,£) 


1 

2 


Xi - Xi 


2 
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and 


, p(y\x 3 ) , . / \ 

In— 7 -:—r = (®3 - *2,0 + (*3 - *2,*i), 

where (cc, £) is A/"(0, ||cc|| 2 )-Gaussian random variable. 

For decoding error probability P e we have 


M 


Pe < — VPefc, 
e - M ^ 


k =1 


where 


Ppi. = P < max In 


P (s/fo) 

p(l/|0jfc) 


>0|^4, k = 1,... 1 M. 


Denote {{x'^x'^) — 0, i > 2) 


Ah = In 


p {y'\9i) 
p {y'\ e i) 


= {x\ - x[, y ') = (*' - ajj, 0) - ^ 1 , 


F = In 


p (y"\y',Qi) 


p{y"\y',0i) ’ 

It is sufficient to investigate the value P e 1 , for which we have from (TT2|l (TT3|) 


(ii) 


( 12 ) 


(13) 


P e i = P | 111 ax (A, + Yj) > 0 |D, ) < £p{*. + *>o|ft} 

^ ' i> 2 

= ^E y .P (X, + Y, > O|„',0,}. 


(14) 


i> 2 


We can express the value Yj via y’ as follows. Since x" = x"{z') and y" = x'[ + , then 


gYi = p (W^*) 
p{y"\y',0i) 


_ p(j/"l« , ,j/',*i) _ 

Z ly p (2/"|P,2/',<) 

-C4Z'|t/ ,e 1 ■ 1 i; , 


(15) 


where the second equality is based on the fact that in both nominator and denominator the 
same code is used. 

Remark 4. In order to apply the formula (1T51) it is necessary to know only the difference 
\\x" — a?"11 (depending on z'). We do not need to know the whole code used on phase II. 
The selected group of messages of the code for phase II may consist of 2,3,4 messages. For 
example, 3 messages are selected if 3 most probable messages are approximately equiprobable 
and all remaining messages are well separated from them (in metrics dlf 1 ). 

We develop the right-hand side of the formula ffl5|) . The difference \\x" — x'[\\ (depending 
on z') takes on one of 4 possible values (defined by partition groups, which those messages 
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belong to on phase II). It is convenient to separate those cases. Note that for all codewords 
of the /c-simplex code we have 


(16) 


dij = || x" - s"|| L = 2A 2 k/(k — 1), i ± j. 

Then denote 

S k = 2A 2 k/(k - 1), k = 2,...,K, 
do — 2A2. 

In other words, S k , k > 2 is the codewords distance for fc-simplex code, while d 0 is such 
distance for the orthogonal code. If x", x'[ belong to fc-simplex code then 

(*!', x'[ - *") = -4/2 = -A 2 kf(k - 1), k = 2,..., K, 

(x", x” - x'[) = —A 2 = S 0 /2, k — 0. 

The difference ||*"— x'{\\ may take on values 2A 2 (corresponds to k — 0) and 4, k — 2, 3,4. 
Each value 4, k = 2,3,4 appears for phase II if a group of k messages was selected and both 
messages x \, x\ belong to that group. In all other cases the value S 0 is used. 

Assuming d true = 4, introduce non-overlapping sets of random events 

Z it k = {z!: |K-tf'i'U 2 = 4}, A; = 0,2,3,4. (17) 

4 

Denoting formally Z lX = 0, i > 2, we have {z'} = ^ Z ik (here ^ means the union of 

k =0 

non-overlapping sets, and { z'} is the set of all possible outputs z'). 

We may continue (fT5l) as follows 


e Yi = 


E \e( x "’ x "- x ") + &"’ x "- x ");Zi h 


k =0 


y 


J2 r ’-t>- Sk/2+ £"’ X "-X") 


= l^pk e 
k =0 


where E[£; A] = E(£ • I{A}), Pi — 0 and 

Pk =Pk{y') =Pk(£) = P ( z i,k\y') > k = 0) 2, 3,4. 
Then using (1T31) we have 

e Xi+Yi = ^ p k e 


(18) 


Ai+Yi _ sr v-Ai-s^+ixr-x^'wC,X’'-X'{) 


k =0 


and therefore (since k takes on one of four possible values) 

p {Xi + 4 > 0|4} = EP {e Xi+Yi > i|t/A} = 

= EP i^ pke - A i-^Mx'-x[4')Hx'Ax'{,e) > i\y^ 9l l < 


. k =0 


< 


4 EP 


Pke 


-Ai-S k /2+(X 


k =0 


> 1/4 ] f] Zi , k ly'.e,} 


(19) 


4 

X! P {[(*i - O + (*" - * 1 , O + ln /hA') > A + 4/2 - In 4] p| Z i)fc |0i} , 


fc =0 
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where \\x" — x'{\\ 2 = 5k for the set Zi^. Denote 


{x' i ,g) = \/A i£', (x'i, rf) = y/A[rii, i = l,...,M, 

where all {£(, r/'} are independent A/"(0, 1 )-Gaussian random variables. 
Since (x” — x”, £") ~ a /5kP' for the set Z iyk , we get from (TTTJ1) and (12U1) 


(20) 


P{Ab + K ; >O| 0 !} <e o(1) 


k =o 


Pik — P { \J~A l(£( — £] ) + VdkC + l n Pfc(0 > A\ + 5u 


( 21 ) 


where £" does not depend on and o(l) —> 0 as A\ —> oo. 

Probabilities {pfc(C0} and values P\ k from (12T1) are evaluated in the next section. 

§4. Probabilities Pk{£,') and values P^. Proof of Theorem 

Let £ be J\f(0, 1 )-Gaussian random variable. We will regularly use simple inequality 

OO 

P(£ >z) = — L J e~ u2/2 du < e- z + /2 , 2 e M 1 , (22) 


and its natural generalization 

L e m m a 1. 1) Let (£i,... Ak) be independent W(0,1 )-Gaussian random variables and 
A C M a . Then (x — (xi,..., xk), \\x \\ 2 = x\ + ... + x 2 K ) 


P((£i,---,60 eA)< exp <( --ndj\x\\ 2 


2) Let £,77 be J\f(0 ,1 )-Gaussian random variables and E(£r/) = p. Them 
a) if A — Bp > 0 and B — Ap > 0 then 


P« > Ai? > B) < P p > '/ V + B 2 . 


b) otherwise 


l-p 2 


P (£ > A, ?) > B) < min (P(£ > A). P(r/ > B)} . 

p k 


(23) 


(24) 


(25) 


P r o o f. 1) Let inf ||as|| = r > 0. Then A C M. K \ S(r), where S(r) - the ball of radius 
r. Therefore 

P ((£i,. ■ ■, Zk) e A) < p {( 6 , • • •, Zk) 6 M* \ S(r )}. 

Evaluating the last probability (using spherical coordinates) we get the formula 
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2) We have 


P(£ > A,rj > B) < inf P(£ + ar] > A + aB ) = P ( £ 

a> 0 \ 


\/l + o 2 + 2 up^ 


Minimizing the last expression over a > 0, we get the formulas (124^ — (1251) . □ 

Inequalities f[23 |) - ([25]) give the exact logarithmic asymptotics in a natural asymptotic 
case. 

In order to apply the formula f|2ip . we consider sequentially the cases k — 2, 0, 3,4. 

1. Case k = 2, h 2 = 4x4 2 . It is the simplest case and it takes place with probability close 
to 1. In that case x", x " compose the group S 2 of two selected messages. Neglecting the term 
p 2 we get from (12TT) f| 2 ^]) 


Pi2 < P {(*' - O + 2 > A 1 + 2 A 2 - In 3 } = 

= P S A + 2.4, - ms} < exp < 

< V3e~ {Al+2M)/4 = \/3e _2ll(1+2/3)/4 . 


(26) 


Cases k 7 ^ 2 are more computationally involved and in order to investigate them we will 
need the def ini tion (fTOl) . 

2. Case k = 0, ho = 2/1 2 . It is the most computationally involved case. It takes place 
when the selected group of messages S m contains not more than one of messages x'(, x'[. 
Then 

4 

PiO — PiOm, (27) 

m =2 

where P i0m = P{k = 0, >S m }, m = 2,3,4. We consider sequentially probabilities {P i0m ,7n = 
2,3,4}, starting with P i02 . Denote 


If ^true = #1 then the formulas hold 

di — dj = d\, L — dy + 2 [Xj — a?j, $,), i, j — 1,..., M, 

di - di = d' u + 2{x\ - x'i, O, 

df 1 - df* = d' u - d\j + 2 (x'j - x'^ £' + ar]') = d t - dj + 2 a(x'j - x[, 77'), 

df ) - d^ = d' u + 2 (&} - £' + ar]') = di - d 1 + 2*7(0;} - a?', 77'). 

If, in particular, x' A)t = x' l7 a; /(2)f = x' 2 , and x'^ 1 — x\, i > 3, then in the case 

necessary to have 

4 ” - 4 ” = 2(4 - 4 .?' + «/) > 24 t 7 - 2 . 


(28) 

tS 2 it is 


10 


(29) 






In order to evaluate po = Po{£) from (TT8l) notice that the main contribution to p 0 gives 
the case when the true message x\ is selected, but the message x\ is not. Moreover, in the 
case S 2 maximum of po is attained when x' ( ~ lyi = x \, x[ ^ {aM 1 ^, x'^}. Taking into account 
symmetry of the orthogonal code {#)■}, we may assume that x = x 2 and x= x' i: 
i > 3. Since there are not more than M 3 variants of arranging messages x' ll x' 2l x' i , then 
using (l29lh we have 


Po(0 < {(x' 2 - x[, + arf) > At 2 {£'}J < 

< M 3 P [a{x' 2 - x[, r,') > A,t 2 + < 

(■\/AT2 — ^2 + £i) + 


< M 6 exp 


4<J 2 


Since M = e°( Al \ Ai —> oo (see (E])-©), then from (12B and (1301) for P i02 we get 

+q - m ^ Ai+A \ = 


Pm 2 < e° ,4 ‘>P | s/2A£' + \/V'V - £) - 

= e°^>p ( yy (y5+i^c+c 2 ) - (V A , b + C2> + > ^i(i+A, 


(30) 


(31) 


where we used the representations £' — ^ = \/ 2 C 2 , — £i = C 2 /v/2 + -y/3/2 £, £_L£ 2 and 

\/2A 2 £" + sJ?>Ai/2 £ = a/( 3 + 4/3)Mi/2£, C-LC 2 (here £_L£ means that Gaussian random 
variables £, £ are orthogonal, i.e. independent). 

Denoting X\f~A[ = £, y\J~A\ = £ 2 and using the formula (l23]h we have from (13T1) 


-2 In Pi 02 > A 1 inf (x 2 + y 2 ) + o(A), 

(x,y)eyl 

•A = P,i/: ^3+4^1 + J, - (Ta/v f A > (1 + P)^ 1 - 


(32) 


a 2 V 2 


Denoting A = {y : y < —t 2 /\/2}, hrst we have 


inf (x 2 + y 2 ) = inf (x 2 + j/ 2 ) = inf ( 

(x,y)e(AnAi) y/3+4px+y>{l+p)V2 y/Z+A$x+y>{l+P)s/2 

y<—T2/V2 y=—T2/y/ 2 


A + 2/ 2 ) 


i.e. inhmum is attained on the border of A. Therefore we may assume that t 2 /\/2 + y > 0, 
omit the sign of positive part and replace (1321) by 


-2 In Pj 02 > Mi inf (x 2 + y 2 ) + o(A 1 ), 

(x,y)eA 2 

A 2 = {x, y : x - e(y + a) 2 > B] , 


(33) 
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where 


£ = 


V2(3 + 4 /3)’ 


a = 


\/ 2 (r 2 - a 2 ) 


5 = 


cr 


3 + 4/3 


1 + /3 + 


2 r 2 - cr L 


If B > 0, then for optimal x, y we need x — e(y + a ) 2 = B and therefore (omitting e 2 (y + a) 4 ) 

inf (x 2 + p 2 ) = inf { [e(y + a ) 2 + B ] 2 + p 2 } > 

(x, y )eA 2 v 

> inf {2 Be{y + a) 2 + B 2 + y 2 } = B 2 + a 2 -— > B 2 + a 2 - —. 

~ v 1 J y J 1 + 2Be “ 2Be 


+ o(l) . (34) 


Therefore we get from (l33j) as A\ —» oo 

Ai f (4 + A/3 + 2t 2 - cr 2 ) 2 (r 2 - a 2 ) 2 (r 2 - cx 2 ) 2 (3 + 4/3)a 2 


In Pm > — 


8(3 + 4/3) 


+ 


4 + 4/3 + 2t 2 - a 2 


We consider below only a 2 < 1 and t 2 < 4/9. Then we can simplify the formula 
follows 


as 


hr Pm + —— 


Ai ( (2 + 2/3 + t 2 ) 2 u 


2(3 + 4/3) 


+ f-° r ~ 


2 + 2/3 + r 2 
2(3+ 4/3) 


+ T 2 + To 


+0(1) > 


> 




(2 + 2/3 + r 2 )^ 
3 + 4/3 


+ r 2 + °(1) 


(1 - a 2 ) = 


(35) 


A\{1 + /3)(1 + /3 + r 2 + t 2 ) (1 — a 2 ) 
3 + 4/3 


+ o(Ai), 


since 


(2 + 2/3 + t 2 ) 2 | r| ^ 2 + 2/3 + r 2 ( i 2 

+ T-+3T+T 21 


r 2 < 4/9. 


2(3+ 4/3) 

Consider the case of S 3 and Pm- Again, main contribution to po and Pm gives the case 
when the true message x\ is selected, but x\ is not. Maximum of p 0 ail( l Pm is attained when 


/(l)i _ „/( 2 )i _ / /( 3 )i _ / 


/-ft' \ / - ry* ryt \ / - ry*' ry» 

ills - t JU "j^ } O s tv 


= *3 and i > 4. Moreover, we need < d} 2 < dd 3 ’ < dP and 


+) 


j(b 


j(d 


2 ; 


> d ( -p + 2A!T 3 . Then neglecting r 2 , similarly to (1301M13B we have 

Po(£') < +d 4 P ((®' 2 - x[, £' + ot/) > Axt 3 , (x'i ~ x\, £' + arj') > A,t 3 {£'}) < 
< M 4 P ((x' 2 + x' 3 - 2x', g + err/') > 2+^3 {£'}) = 

= M 4 P (a(4 + ^ - 2*;, V) > 2A3T3 + 2y/A^ - y/A^ - VA^ 3 \{Q) < 

(2y/A^T 3 — £ 2 — £3 + 2£') 2 


< M 4 exp 


6cr 2 
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and therefore (here £, ( - independent J\f( 0,1)-Gaussian random variables) 


Pm < e o(Al) P - Q + y/2A 2 ? 


> A\ + A 2 c + 


(^ VAiT 3 — £2 ~ £3 + 2 £-) 

6<J 2 

< e»«p{\/2+/3< + +V+ - v+fi + AMC - (2 ^ g + AAL > J 4 1 + /1 2 | — 

= e o(/ll) P | AA( + AAA + 3,4 2 ){ - AA/PPIjti± A± > ^3(4, + A 2 ) J , 

where we used the representations 2£' — £2 — £3 = \/6C £) = a/ 2/3 £ + Ci/\/3, C-LCi and 
similar ones. 

Denoting y\[A\ = £, xy/Ai = / and using the formula ([23]) , similarly to (1321) we have 

-2 In P,;o3 > inf (x 2 + c/ 2 ) + o(Ai), 

(x,y)eA 

A = {x, y : x — e{y + a) 2 > P} , 

where we omitted the sign of the positive part (similarly to (1331) ') and where 

6(1 + /3) + 4t 3 - cr 2 


2t 3 - cr 2 


£ = 


a = 


B = 


2 4/6(2 + 3/3) 


a 2 y 2(2 + 3/3) ’ /6 ’ 

Now similarly to the case P*o2 we get as A\ —> 00 

~ In P0.3 > 

Ai f [6(1 + ft) +4r 3 - cr 2 ] 2 (2r 3 - cr 2 ) 2 (2t 3 - q 2 ) 2 (2 + 3/3)cr 2 

8(2 + 3/3) 2 6(1 + /3) + 4r 3 - a 2 ° U 


> 


6 V 


(36) 


For a 2 < 1 the formula 


hi P,;o3 > 


can be simplified as follows 
4li(l + /3)(1 — a 2 ) 


[2+ 3/3+ (1 + 2 r 3 ) 2 ]. 


(37) 


4(2 + 3/3) 

Consider Pj 04 . Maximum of p 0 and P i0 4 is attained when x'^ = a/, j = 1,... ,4 and 
i > 5. Moreover, we need d/ < d/ < d/ < d/ < d/. Then for any cr and /3 < 1/2 

Pm < e o(Ai) p {^(Z - ei) + >7i, + a 2 , / > c e 3 > & e; >/} < 

< e 0 ^ 1 ^P y/AA[ + y/Ai + 2A 2 £ > ^4i + A 2 , £2 > /3/ ^ < 

< e o(j4l) min P { y/4/1 - oa/3)^ + ay/A^^ 2 + \/4i + 2A 2 £ > Ai + A 2 ) = 

a> 0 L J 

= n un P | \J /li[(l — aV 3) 2 + a 2 + 1] + 2/1 2 £ > Ai + /t 2 | < 

< e°^dp 175^/4 + 2/12^ > Ai + A 2 } < 


(38) 


< exp 


. 2(1+/3)2Al + oIAa) < e-( 1 +« A i/3+ 0 (^). 
5 + 8/3 v - 
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Therefore for a 2 < 1 and r 2 < 4/9 we get from (j35j) . (1H71) and (1381) 


- In Pio > 


(1 + /3)Ai 

4 


1 + min 


(1 + 2 t 2 ) 2 (1 + 2 r 3 ) 2 

3 + 4/3 ’ 2 + 3/3 



(1 _ (T 2 ) + o(A) . 


(39) 


Note that if t 2 ,T 3 satisfy conditions 


79 > -=-« 0.1455, 

“ \/l5 + 3 


r 3 > 


1 

2(^42+ 6) 


« 0.04006, 


then for any /3 < 1/2 the formula (1391) takes the form 


(40) 


- In Pi o > 


Ai(l + /3)(1 — cr 9 


+ o(Ai). 


(41) 


3. Case k = 3. S 2 = 3A 2 . This case takes place if the group S’’ of three messages was 
selected and x \, x\ e S 3 . Then \\x" — x'[\\ = 3A 2 . Main contribution to 'P:i(y') and Pi 3 is 

given by case [x \, a/} = ja:'^/ a^ 2 ^|. Moreover, since we are interested in the probability 
P {Xj + Fj > 0|y', # 1 } and \\x” — tc"|[/v4 2 = 3 > || x\ — a^U/Ai = 2, then we may assume 
that > <t 2 . More exactly, without loss of generality, we may assume that i — 2 and 
x'i = x' 2 = x'^ t , x\ = x'^ 1 , *3 = x'^ 1 . Then first we have 


Pi 3 < M 3 P{di - d 2 > 0, df - df < 2A 1 t 2 \9 1 } < 

< M 3 P{( x 2 — x^, £') + y/ 3/1 2 ^" > Ai + 3A 2 /2, (x ' 3 — a;^, ^ + err/ 7 ) > Ai(l — r 2 )|$i} < 
= M 3 P{£' - £ + > VA~i(l + 3/3/2), ^ > ^4/(1 - r 2 )}. 


Since & - £ + \/3/3£" ~ V 2 + 3/3 f and - a\/2-// ~ y/2(l + a 2 ) C, where f, C - 

A/"(0,1)-Gaussian random variables with E(££) = —1/>/2(2 + 3/3)(l + a 2 ), then using the 
inequalities (l24ll and ([22]) . we get as Ai —* 00 


ki P%i > — InP £ > -a/(2 + 3/3)Al, rj > (1 — r 2 )\/4Li/[2(l + cr 2 )] + o{A\) > 


> 


Ai 


2 + 3/3 (1 — t 2 ) + (1 — r 2 )^ 


> 


kll(l + /3) 


1 + 


p 


1 + cr 2 


+ 


+ o(Ai) > 


2 — 3r 2 


2(1+ /3) (l + /3)(l + a 2 ) 

We limits ourselves only to values /3 < 1/2, r 2 < 1/3, cr 2 < 1. Then 

4fi(l + P) 


+ o(Ai). 


- In Pi 3 > 


+ 0 (^ 3 ). 


(42) 
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4. Case k — 4. 84 — 8 A 2 / 3 . Similarly to S 3 maximum of po and P *4 is attained when 
x= x'^x' 1 - 2 ^ = x[, x'W = x' 3 , x= x\. Moreover, we need df^ < d { p < d® < d® 
and d^p — dp < 2 Ait 2) dP — dP < 2 Ait 3 . Then we have 

P % ± < M 4 P{(*' 2 - ®i, £') + V8 A 2 /3C >A 4 + 4A 2 /3, 

(*3 - X 1>£ + °V) > ^1(1 - r 2 ), (*4 - x’ 3 ,g + arj') > -A^} = 

= M 4 P|^2 - V + VWJ3C > VA~i(l + 4/3/3), 

£3 - £1 + Vha - hi) > V^(! - + 0 > v - £3 + <7(774 - ha) > ~ V^Ps} = 

= M 4 P{ Vl + 8/3/36 - V > y/A~i(l + 4/3/3), 

\/l + <7 2 f 3 - £[ - o+i > y/^ii 1 - t 2 ), Vl + <r 2 (f 4 - £ 3 ) > - V^4/r 3 | 
and therefore 

—2 In Pj 4 > Ai min ||z|| 2 + o{A 1 ), 2 = (z u .. ,,z 5 ), 

ZeA 

A = jz : y/l + 8/3/3 z 2 - Zi > 1 + 4/3/3, \/l + a 2 £ 3 — - crz 5 > 1 - r 2 , 

\/l + <T 2 (Z4 - Z 3 ) > -T 3 |. 


Minimum is attained when there are equalities in all three inequalities. Then 
^4 = £3 - 7 ==, <7Z 5 = VTT^, 3 - Z! ~ 1 + T 2 , 

VI + cr 

and after standard algebra we get 


min \\z\\ = min < z-, + 


,2 (3gi + 3 + 4/3) 2 hi + (ha - r 3 ) 2 ^ (h3 ~ + ~ 1 + r 2 ) 


ze.4 


21. 2/3 


3(3 + 8/3) 


1 + < 7 - 


+ 


c+ 


= (1 — T 2 ) 2 + 


2_2 


+ 


er"r. 


1 - r 2 


3t 2 + 4/3 r 3 


3 + 8/3 1 + 3a 2 


(3r 2 + 4/3) ; 

3(3 + 8/3) ^ (1 + <r 2 )(l + 3<r 2 ) _ ~ 3 | 1 | l+V 2 “ 


> 


3 + 8/3 a 2 <r 2 (l + 3<r 2 ) 


>(l-r 2 ) 2 + 


2 (3r 2 + 4/3)" 


3(3 + 8/3) 


cr 


1 - r 2 


3t 2 + 4/3 r 3 


1 2 


3 + 8/3 1 + 3a 2 


> 


> 


(1 _ T2 )2 + ( 3r 2 + W 


3(3 + 8/3) 
since for r 2 + r 3 < 1 we have 

(l-r 2 ) 2 > 


2 (3 + 4/3) (3 + 4/3 — 6t 2 + 6rf )(1 — cr 2 ) 

[ a } ~ 3(3 + 8/3) 


1 - r 2 - 


3t 2 + 4/3 r 3 


3 + 8/3 1 + 3<r 2 
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Therefore if r 2 + 73 < 1, then 


— In Pj 4 > A 1 f 4 (f3,T 2 )(l - a 2 ) + o{A 1 ), 
hiH t } = (3 + 4/3)(3 + 4/3 — 6t 2 + 6r|) (43) 


Note that / 4 (l/2, r 2 ) > 1/2, if r 2 < (15 - \/T05)/30 « 0.1584. 

Consider now the overall error probability P, from Oil) . Assuming cr 2 < 1 , we set f3 < 1 / 2 . 
Then for r 2 ,r 3 satisfying conditions (1301) and r 2 < 1/3 we get from (ITT]) . (1001) . (1301) and (1351) 
as A\ —* 00 

— In Pi > min{ — In k = 0, 2, 3,4} + c^Ax) > 

> Ai(l - a 2 ) min ~ O 2 )} +o(A x ). 

We set 0 = 1/2. Then for any [sjhj?, - l) /2 « 0.1455 < r 2 < (15 - \/l05)/30 « 0.1584 
and ^^7/6 — 1 j /2 « 0.04006 < r 3 < 1 — r 2 we get as A 3 —> 00 (i.e. as n —> 00 ) 


- In Pi > 


A l (l-a 2 ) 

2 


+ o(n) 


An(l — a 2 ) 
3 


+ o(n). 


(44) 


Since P e < P el (see (TTTD ) and M = e o(n \ n —> 00 , then from (1131) . (1011) and (1331) we get 

An( 1 — cr 2 ) 


- In P e > 


+ o(n), 


which completes the proof of Theorem. □ 
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